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Abstract. The current study examined the utility and incremental validity of parent 
ratings on the Strengths and Difficulties Questionnaire and Disruptive Behavior 
Disorders rating scale completed at kindergarten registration in identifying risk 
status as defined by important criterion variables (teacher ratings, daily behavioral 
performance, and quarterly grades). The participants were 252 kindergarten 
students from one school district. Receiver operating characteristic analyses and 
area-under-the-curve values indicated that most subscales had low to moderate 
utility in identifying children showing at-risk academic performance and social, 
emotional, and behavioral problems. However, forward linear regression analyses 
indicated that parent ratings provided incremental validity relative to the aca¬ 
demic screening tool used by the school district, accounting for an addi¬ 
tional 3.69% to 22.37% of the variance in kindergarten outcomes. Implications for 
the use of parent ratings in universal screening for social, emotional, and behav¬ 
ioral problems at kindergarten entry are discussed. 


Epidemiologic data indicate that 15% to tal health problems and few of these children 
20% of children ages 6 to 17 years have men- receive adequate care (Kataoka, Zhang, & 
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Wells, 2002). Many children who are at risk of 
mental health problems manifest social (e.g., 
difficulty getting along with others), emotional 
(e.g., anxious, despondent), or behavioral 
(e.g., inattention, hyperactivity-impulsivity, 
aggression) problems (or a combination 
thereof) in early childhood (Pihlakoski et al., 
2006; Wakschlag et al., 2007). Left unad¬ 
dressed, social, emotional, and behavioral 
(SEB) problems place children at risk of 
school dropout, substance use, criminal behav¬ 
ior, and physical and mental health problems 
in adulthood and are costly for the education, 
healthcare, and justice systems (see National 
Research Council and Institute of Medicine, 
2009, for review). 

One way to prevent negative outcomes 
associated with SEB problems is to provide 
early intervention in schools or through com¬ 
munity agencies. Indeed, several randomized 
trials document the benefits of systematic, 
high-quality education and intervention in 
early childhood for children’s cognitive, lan¬ 
guage, social-emotional, and behavioral de¬ 
velopment (Conduct Problems Prevention 
Group & Dodge, 2007; Ramey & Ramey, 
2004). Thus, there is a need for tools to iden¬ 
tify children who may benefit from such early 
intervention. Although there is strong support 
for the utility of teacher ratings in screening 
for SEB problems (Feeney-Kettler, Kratoch- 
will, Kaiser, Hemmeter, & Kettler, 2010; Har¬ 
rison, Vannest, & Reynolds, 2013), teacher 
ratings are only available after the child has 
acclimated to the school environment and 
started to demonstrate failure in social, behav¬ 
ioral, or academic domains. 

Identification of children with SEB 
problems should be done as early as possible 
to help children avoid early failure experi¬ 
ences and to maximize efficiency of interven¬ 
tion efforts. Kindergarten registration is an 
opportune time to systematically screen chil¬ 
dren for SEB problems via parent ratings. 
Studies examining the psychometric proper¬ 
ties of parent-rated screening measures in 
young samples are emerging (Feeney-Kettler 
et al., 2010; Harrison et al., 2013). However, 
in most studies the criterion variable that is 
predicted is another rating scale rather than an 


outcome that is important to parents, teachers, 
or administrators (e.g., grades, severe behav¬ 
ioral infractions) and on which educational 
decisions (e.g., grade retention, suspension) 
are made. Given that parent ratings could 
identify risk status prior to the child experi¬ 
encing school failure, research is needed to 
determine if parent ratings at kindergarten reg¬ 
istration predict indicators of school success 
that have high ecological validity. 

Our university-community partnership 
examined (a) the utility of two publicly avail¬ 
able parent rating scales that assess child SEB 
problems, completed at kindergarten registra¬ 
tion, in identifying risk status on teacher rat¬ 
ings, as well as criterion variables of priority 
to school district personnel (i.e., grades, daily 
classroom behavior), and (b) the incremental 
validity of parent ratings in predicting those 
criterion variables beyond that predicted by 
the school district’s academic screening tool 
used at kindergarten registration. To our 
knowledge, no studies have examined the in¬ 
cremental validity of parent ratings over other 
school-based tools used to detect risk. 

SEB PROBLEMS AFFECT SCHOOL 
SUCCESS 

There is substantial evidence that SEB 
problems can be reliably and accurately de¬ 
tected at an early age (Keenan & Wakschlag, 
2004; Mathiesen & Sanson, 2000) and that 
such problems are associated with negative 
outcomes in multiple domains of functioning, 
including school performance (see Suldo, 
Gormley, DuPaul, & Anderson-Butcher, 2014, 
for review). For example, as early as pre¬ 
school, inattention, hyperactivity, low-energy, 
social-reticence, and withdrawn behaviors ob¬ 
served by teachers at the beginning of the 
school year predict social and academic diffi¬ 
culties at the end of the year (Fantuzzo, Bu- 
lotsky, McDermott, Mosca, & Lutz, 2003). 
Teacher-rated social and self-regulation skills 
assessed in kindergarten are related to growth 
in academic skills through sixth grade. For 
example, children with lower levels of social 
and self-regulation skills start behind their 
peers academically in kindergarten and fall 
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further behind by second grade (McClelland, 
Acock, & Morrison, 2006). Furthermore, there 
is evidence that SEB problems persist into 
later childhood (Pihlakoski et al., 2006) and 
young adulthood (Kessler et al., 2005), affect¬ 
ing important school outcomes such as grade 
retention and graduation rates (Loe & Feld¬ 
man, 2007). Together, these studies document 
significant negative academic outcomes asso¬ 
ciated with SEB problems in young children, 
in both the short and long term. Given the 
importance of school success to future em¬ 
ployment and economic status (U.S. Depart¬ 
ment of Labor, 2013), identifying and address¬ 
ing problems that interfere with academic suc¬ 
cess are important to students, families, 
schools, and society. However, children with 
SEB problems often go undetected in schools 
(Walker, Nishioka, Zeller, Severson, & Feil, 
2000 ). 

CURRENT PROCEDURES FOR 
IDENTIFYING SEB PROBLEMS IN 
STUDENTS 

Several strategies can be used to identify 
children with SEB problems, including ob¬ 
servational methods, teacher referral, and uni¬ 
versally administered teacher ratings. Each 
method has advantages and disadvantages. 
Systematic direct observation of student be¬ 
havior by an external observer is least suscep¬ 
tible to bias and captures the dynamic between 
student behavior and context (Landau & 
Swerdlik, 2005). However, observations re¬ 
quire extensive personnel resources and mul¬ 
tiple observations to obtain an adequate sam¬ 
ple of behavior (Briesch, Chafouleas, & 
Riley-Tillman, 2010; Volpe, McConaughy, & 
Hintze, 2009). 

Teachers are useful reporters of aberrant 
student behavior (e.g., Racz, King, Wu, Wit- 
kiewitz, & McMahon, 2013); thus, school dis¬ 
tricts may rely on teacher referrals to trigger 
assessment or service provision to students. 
However, individual teachers have different 
thresholds for behaviors warranting a referral 
for services, and a sizable minority may not 
refer children because they believe that it is 
not part of their role or they feel uncomfort¬ 


able or unqualified to do so (Reinke, Stormont, 
Herman, Puri, & Goel, 2011). Thus, relying on 
teacher referral (rather than universally admin¬ 
istered teacher ratings) can result in unde¬ 
tected and unaddressed risk in children. In¬ 
deed, studies show that 50% to 60% of chil¬ 
dren in first through sixth grade identified as at 
risk of SEB problems via universal teacher 
ratings were not referred for services by their 
teachers (Eklund & Dowdy, 2013; Eklund et 
al., 2009). Teacher referral has less predictive 
utility in identifying academic risk than uni¬ 
versally completed teacher ratings (VanDer- 
Heyden, Witt, & Naquin, 2003). Together, 
these data support seeking alternative or com¬ 
plementary strategies beyond teacher referral 
to identify at-risk students. 

ADVANTAGES AND 
DISADVANTAGES OF UNIVERSAL 
SCHOOL-BASED SCREENING 

There are many benefits of universal 
school-based screening for SEB problems. 
First, universal screening (by either parent or 
teacher) ensures that all children are evalu¬ 
ated, reducing the likelihood of undetected 
risk. Second, screening results can identify 
target areas for early intervention and provide 
a baseline against which intervention efforts 
can be compared (at the child, classroom, or 
school level). Third, early detection, when 
connected with early intervention, interrupts 
the pathway between early risk factors and 
negative long-term outcomes (Conduct Prob¬ 
lems Prevention Group & Dodge, 2007; 
Walker et al., 2009; Webster-Stratton, Rinaldi, 
& Reid, 2010). Thus, parent ratings obtained 
prior to school entry may be particularly im¬ 
portant because teacher ratings are only avail¬ 
able after the child has shown some level of 
social, behavioral, or academic failure. Fur¬ 
thermore, there is evidence that successful 
early intervention can produce up to a $2 
million savings per youth treated (estimates 
take into account the costs of high school 
dropout and a career of adult criminal behav¬ 
ior and substance use, all of which are linked 
to juvenile antisocial behavior; Cohen, 1998). 
Although nearly 60% of children in the United 
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States attend preschool or receive center-based 
care (U.S. Census Bureau, 2013), systematic 
screenings with teacher report may not be 
feasible given the diversity in infrastructure 
across such settings. 

Kindergarten registration represents a 
natural opportunity to systematically obtain 
ratings of child behavior and functioning from 
all parents. However, there are also challenges 
to using universal screening, particularly with 
young children. It can be difficult to identify 
risk status during a period in which variation 
in behavior is developmentally normative. 
False-positive cases could create unnecessary 
distress among parents and unnecessary finan¬ 
cial expenditures for school districts, depend¬ 
ing on follow-up actions taken. Similarly, 
screening tools have imperfect psychometric 
properties; thus, some students may still go 
undetected and unserved (false negatives). 
Lastly, screening may only be helpful to the 
extent that resources can meet the needs indi¬ 
cated by the screening results. Given the large 
number of unidentified and untreated youth on 
the one hand and the potential for possible 
negative outcomes on the other hand, research 
is needed to identify valid and reliable screen¬ 
ing tools and to formulate data-driven guide¬ 
lines for universal screening procedures. 

PARENT-BASED SCREENING TOOLS 

There is strong support for the utility of 
teacher ratings in screening for SEB problems 
in preschool and early elementary school stu¬ 
dents (Feeney-Kettler et ah, 2010; Harrison et 
ah, 2013). Teacher ratings in kindergarten are 
correlated with student grades (rs ranged from 
-.40 to -.20), test scores (rs ranged from -.30 
to -.29), and discipline infractions (rs ranged 
from .37 to .57) 2 years later (DiStefano & 
Kamphaus, 2007). However, data supporting 
the predictive utility of parent-rated screening 
tools are limited. Several articles have sum¬ 
marized the psychometric properties of the 
parent versions of the Behavioral and Emo¬ 
tional Screening System (BESS; Kamphaus & 
Reynolds, 2007); the Ages and Stages 
Questionnaires: Social-Emotional (ASQ-SE; 
Squires, Bricker, & Twombly, 2003); the Pre¬ 


school Behavior Screening System (PBSS; 
Feeney-Kettler, Kratochwill, & Kettler, 2011); 
and the Strengths and Difficulties Question¬ 
naire (SDQ; Goodman, 2001). The reviews 
document that these measures have acceptable 
internal and test-retest reliability and concur¬ 
rent validity (Feeney-Kettler et ah, 2010; Har¬ 
rison et ah, 2013). Preliminary diagnostic util¬ 
ity statistics are also available for all mea¬ 
sures. However, in most studies the criterion 
variable used to calculate the diagnostic utility 
statistics was another rating scale completed 
by the same rater (e.g., parent BESS with 
parent Behavior Assessment System for Chil- 
dren-Second Edition [BASC-2] from which 
the BESS was developed; Phase 1 of the par¬ 
ent PBSS with Phase 2 of the parent PBSS). 
This common-rater and common-method ap¬ 
proach is an important context in which to 
interpret the data because both could contrib¬ 
ute to inflated associations between measures 
(which are used as evidence for concurrent 
validity). In addition, these outcomes do not 
indicate whether parent measures predict eco¬ 
logically valid indicators of risk (e.g., grades, 
discipline referrals). 

Although psychologists prioritize rating 
scales and diagnoses, educators and parents 
often do not. Instead, parents and educators 
often judge school success by the child’s 
grades and how well the child follows school 
rules and gets along with teachers and peers on 
a daily basis. Thus, in addition to using teacher 
rating scales as a criterion, we selected indi¬ 
cators of school success with high ecological 
validity (Suldo et ah, 2014). Namely, student 
grades (i.e., grade point average [GPA]) and 
daily social and behavioral functioning ac¬ 
cording to the daily level achieved in the 
school-wide positive behavior support 
(SW-PBS) program (e.g., green, yellow, or red 
level) were considered critical outcomes. If 
research on universal screening is to make an 
impact on school policy, we need to know 
how well the screening tool predicts the indi¬ 
cators that are most important to consumers 
(i.e., parents, teachers, administrators) and that 
are used to make educational decisions. For 
example, it is unlikely that a child would be 
retained or suspended because of a score on a 
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rating scale. However, a child is very likely to 
be retained because of low grades or to be 
suspended because of an act of disrespect, 
defiance, or aggression, which would be re¬ 
flected by the child’s daily behavioral level 
achieved in an SW-PBS program. 

CURRENT STUDY 

To identify risk status prior to the onset 
of school failure experiences, we need to un¬ 
derstand the utility of parent ratings, com¬ 
pleted before the start of school, in identifying 
students at risk of academic and SEB prob¬ 
lems (because teacher ratings are not available 
at this time). We sought to identify a free, 
publicly available, psychometrically sound 
parent-rated measure that could help school 
personnel screen for SEB problems at kinder¬ 
garten registration and throughout the kinder¬ 
garten year. Interestingly, most of the parent¬ 
rated screening measures in the literature did 
not fit the aforementioned characteristics (e.g., 
the BESS is not free; the PBSS is not publicly 
available; the ASQ-SE has an upper age limit 
that prohibits ongoing assessment). In this 
context, the team selected the SDQ because it 
is free, publicly available, and psychometri¬ 
cally sound and assesses a broad array of 
SEB problems, including peer problems, 
prosocial behavior, emotional problems, be¬ 
havioral problems, and hyperactivity-inatten¬ 
tion. As a comparison, we selected another 
parent rating scale that also met the aforemen¬ 
tioned criteria but offered depth in the assess¬ 
ment of disruptive behavioral problems be¬ 
cause these problems are predictive of future 
negative academic and social outcomes for 
children (e.g., Dodge, Greenberg, Malone, & 
Conduct Problems Prevention Research 
Group, 2008; Moilanen, Shaw, & Maxwell, 
2010) and are associated with significant 
teacher stress, negative teacher-student inter¬ 
actions, and even workforce turnover (Greene, 
Beszterczey, Katzzenstein, Park, & Goring, 
2002; Ingersoll, 2001). Namely, the Disrup¬ 
tive Behavior Disorders (DBD) rating scale 
(Pelham, Gnagy, Greenslade, & Milich, 1992) 
assesses the presence and frequency of the 
clinical symptoms of inattention, hyperactivi¬ 


ty-impulsivity, and oppositional defiant be¬ 
havior. Although there is overlap between the 
SDQ and DBD in the constructs assessed, it 
was important to determine the relative utility 
of depth versus breadth of assessment do¬ 
mains. The DBD has nine items for inattention 
and nine items for hyperactivity-impulsivity; 
the SDQ has five items on the hyperactivity- 
inattention subscale. If the shorter subscale of 
the SDQ is as predictive of outcomes as either 
longer subscale, then the measure with fewer 
items is more feasible to administer. 

Our primary aims were to examine (a) 
the utility of two publicly available parent 
rating scales that assess child SEB problems in 
identifying risk status as defined by the teacher 
BESS ratings, quarterly grades, and daily be¬ 
havioral performance in the SW-PBS program 
and (b) the incremental validity of parent rat¬ 
ings in predicting those criterion variables be¬ 
yond that predicted by the existing school 
district academic screening tool. This study 
advances the literature by examining the util¬ 
ity and incremental validity of parent ratings, 
completed at kindergarten registration, in re¬ 
lation to multiple ecologically valid kindergar¬ 
ten outcomes. 

METHOD 

The parents of 273 kindergarteners 
(94% of all kindergarten students enrolled in 
the district), along with their teachers, con¬ 
sented to participate. Of the kindergarten stu¬ 
dents, 252 (92.3%) had sufficient rating scale 
data to be included in the analyses (21 had 
entire rating scales missing because families 
moved either prior to the start of school or 
before the end of Quarter 1). Those included 
in the analyses did not differ from those ex¬ 
cluded regarding age, sex, race, school build¬ 
ing assignment, mother’s or father’s highest 
level of education, current receipt of mental 
health services, or problem severity on the 
parent screening measures. See Table 1 for 
sample characteristics. Most respondents were 
mothers (85.7%), and the remainder were fa¬ 
thers (8.7%), grandmothers (3.6%), or other 
caregivers (2%). The data in Table 1 indicate 
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Table 1. Demographic 
Characteristics 


Variable 

n (%) or 

M ( SD) 

Sex (male) 

127 

(50.4%) 

Race (White) 

239 

(94.8%) 

Mother’s highest education level 



No high school diploma 

21 

(8.3%) 

High school diploma 

75 

(29.8%) 

Partial college or associate degree 

116 

(46.1%) 

Bachelor’s degree 

22 

(8.7%) 

Graduate degree 

11 

(4.4%) 

Father’s highest education level 



No high school diploma 

23 

(9.1%) 

High school diploma 

111 

(44.0%) 

Partial college or associate degree 

72 

(28.5%) 

Bachelor’s degree 

14 

(5.6%) 

Graduate degree 

3 

(1.2%) 

Attended preschool 

193 

(76.6%) 

Age, years 

4.87 

(0.41) 

SDQ subscales 



Emotional problems 

1.42 

(1.76) 

Behavioral problems 

1.37 

(1.66) 

Peer problems 

1.18 

(1.40) 

Hyperactivity-inattention 

3.53 

(2.51) 

Prosocial behavior 

8.70 

(1.94) 

DBD 



Inattention 

0.50 

(0.54) 

Hyperactivity-impulsivity 

0.69 

(0.57) 

Oppositional defiant behavior 

0.42 

(0.48) 

BESS total t score 

46.13 

(10.27) 

KRA-L 

19.57 

(6.16) 

% green days a 

91.80 

(13.59) 

GPA 



Qi 

3.53 

(0.70) 

Q4 

3.67 

(0.61) 

Note. N = 252. For some categories, percentages do not 
sum to 100% because of missing data. BESS = Behav¬ 
ioral and Emotional Screening System; DBD = Disrup¬ 
tive Behavior Disorders Rating Scale; GPA = grade point 
average; KRA-L = Kindergarten Readiness Assessment- 
Literacy; Q = quarter; SDQ = Strengths and Difficulties 
Questionnaire. 

Percentage of green days across kindergarten year. 


that the sample is representative of the com¬ 
munities in the Appalachian region of Ohio 
from which it was drawn (U.S. Census Bu¬ 
reau, 2006). All 12 teachers were female and 
White. 


Measures 

Parents completed a demographic ques¬ 
tionnaire to document child sex, age, race or 
ethnicity; family moves in the past year; pre¬ 
school experience and other socialization op¬ 
portunities; and parental education and em¬ 
ployment. Parents also completed two addi¬ 
tional rating measures, as described in the 
following sections. 

DBD Rating Scale 

The DBD (Pelham et al., 1992) is a 
45-item rating scale that assesses the presence 
of symptoms of inattention, hyperactivity-im- 
pulsivity, oppositional defiant behavior, and 
conduct disorder behavior, as listed in the 
fourth and fifth editions of the Diagnostic 
and Statistical Manual of Mental Disorders 
(American Psychiatric Association, 2000, 
2013) for children in preschool through high 
school. Parents rated each item on a 4-point 
scale ranging from 0 ( not at all present) to 3 
{very much present). For this study, conduct 
disorder items were omitted owing to low base 
rates of these items in this age group. Internal 
consistency (a > .85) and 4-week test-retest 
stability (>.75) were adequate for research, 
and construct validity of the inattention and 
hyperactivity-impulsivity scales has been 
demonstrated through significant correlations 
with similar subscales (DuPaul, Power, Mc- 
Goey, Ikeda, & Anastopoulos, 1998). In the 
current sample, internal consistency estimates 
were .92 (inattention), .90 (hyperactivity-im¬ 
pulsivity), and .88 (oppositional defiant behav¬ 
ior). A subscale average score of 1 or greater 
has been used to denote at-risk status on this 
measure (Owens, Johannes, & Karpenko, 
2009; Swanson et al., 2001). By use of this cut 
score, 16.7%, 25.4%, and 11.1% of students 
were showing at-risk levels of inattention, hy¬ 
peractivity-impulsivity, and oppositional defi¬ 
ant behavior, respectively. 

Strengths and Difficulties Questionnaire 

The SDQ (Goodman, 2001) is a 25-item 
questionnaire in which items are rated on a 
3-point scale ranging from 0 {not true) to 2 
{certainly true). We used the parent version 
for children ages 4 to 10 years. Four studies 


26 







Screening for SEB Problems 


(see Stone, Otten, Engels, Vermulst, & Jans¬ 
sen, 2010, for review) confirm the five-sub- 
scale factor structure: emotional problems, be¬ 
havioral problems, peer problems, hyperac¬ 
tivity-inattention, and prosocial behavior. 
Across 26 studies, the weighted average inter¬ 
nal consistency of the parent subscales ranged 
from .53 (peer problems) to .76 (hyperactivi¬ 
ty-inattention) and the stability over 4 to 6 
months ranged from .65 to .71. High scores on 
the parent SDQ are also associated with in¬ 
creased risk of a psychiatric disorder (Good¬ 
man, 2001; Stone et al., 2010). Higher scores 
on the prosocial behavior subscale indicate 
more acceptable behaviors; higher scores on 
all other subscales indicate more problematic 
behaviors. In our sample, internal consistency 
estimates were .70 (emotional problems), .70 
(behavioral problems), .83 (hyperactivity-in¬ 
attention), .48 (peer problems), and .72 
(prosocial behavior). Using the recommended 
cut scores (see www.sdqinfo.org), 16.7% 
(peer problems), 17.9% (hyperactivity-inat¬ 
tention), 18.3% (emotional problems), 20.2% 
(behavioral problems), and 20.6% (prosocial 
behavior) of students were at risk depending 
on the subscale. 

BASC-2 BESS-Teacher Version 

The BESS (K-12 form) is a 26-item 
measure that screens for internalizing, exter¬ 
nalizing, and adaptive behavioral problems in 
children (Kamphaus & Reynolds, 2007). Data 
from a large normative sample show that the 
teacher version total scores have acceptable 
interrater (>.70) and internal reliability 
(>.94) for research, as well as evidence of 
convergent and predictive validity for stan¬ 
dardized achievement test scores and student 
GPAs over 4 years (rs ranged from .34 to .63). 
There is also evidence of strong test-retest 
reliability (>.85, with approximately 40-day 
interval; Feeney-Kettler et al., 2010). In this 
study, BESS total scores were used as a crite¬ 
rion variable. By use of a cut score for age- 
based t score of 61 or higher, 10% of students 
in this sample were at risk. 

Grade Point Average 

A quarterly GPA was calculated using a 
4-point scale (based on participants’ grades in 


reading, spelling, math, science, and social 
studies) and used as a criterion variable. Stu¬ 
dent grades were classified as follows: above 
satisfactory (4.0), satisfactory (3.0), below sat¬ 
isfactory (2.0), needs improvement (1.0), and 
unsatisfactory (0.0). Given that a GPA lower 
than 3.0 was considered below satisfactory, a 
cut score of less than 3.0 was used as the 
threshold to define at-risk status for academic 
problems. Across quarters, 8% to 18% of stu¬ 
dents fell below this cut score. 

Behavioral Functioning Indicator 

All schools in the participating district 
used an SW-PBS framework. Each teacher 
used a behavior wheel that documented the 
students’ daily rule-following behavior. Each 
student began the school day with his or her 
clip on the green segment of the wheel and 
moved his or her clip with each additional rule 
violation, such that the yellow segment repre¬ 
sented a first warning, the orange segment 
represented a second warning (with a possible 
mild consequence), and the red segment rep¬ 
resented a referral to the office or parent no¬ 
tification (or both). This daily status was doc¬ 
umented on the student’s monthly calendar. 
This measure has strong ecological validity 
and is a metric commonly used by school staff 
and parents to describe a child’s day. The 
percentage of green days achieved for each 
student was calculated by dividing the total 
number of green days achieved between Au¬ 
gust and June by the total number of days the 
student attended school in that time. In collab¬ 
oration with teachers, at-risk status was de¬ 
fined as achieving fewer than 80% green days. 
Teachers reported that a typical student could 
be assigned a level lower than green (“off 
green”) one day per week but that two days off 
green raised concerns about behavioral control 
and was often associated with other problems. 
In the sample, 12% of students fell below this 
cut score (sample range was 15%—100% green 
days; 75% of students had >90% green days). 

Kindergarten Readiness Assessment- 
Literacy 

The Kindergarten Readiness Assess¬ 
ment-Literacy (KRA-L) is a 25-item screen- 
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ing tool developed by the Ohio Department of 
Education to assess early literacy skills in 
three areas: oral language, phonological 
awareness, and print awareness. Total scores, 
ranging from 0 to 29, are organized into three 
bands, each with a recommendation to guide 
decisions about the need for additional assess¬ 
ment and instructional programming: 0 to 13 
(assess broadly for intense instruction), 14 
to 23 (assess for additional instruction in tar¬ 
geted areas), and 24 to 29 (assess for enriched 
or advanced instruction). Across the school 
district, the average score was 19.49 
and 22.42% of students fell in the lowest 
band, 46.98% fell in the middle band, 
and 30.60% fell in the highest band. In the 
current sample, the average score was 19.57 
and 21.2% of students fell in the lowest 
band, 47.6% fell in the middle band, 
and 31.2% fell in the highest band. Although 
the KRA-L has been required by the state 
since 2007, few psychometric data are avail¬ 
able (http://education.ohio.gov/Topics/Early- 
Learning/Guidance-About-Kindergarten/ 
KRAL). 

Procedures 

Parents were recruited to participate in 
the project when they registered their child for 
kindergarten (between April and August 
2011). Once parents provided consent, they 
completed the demographic questionnaire, 
DBD, and SDQ. Parents received a small ed¬ 
ucational gift as compensation for their partic¬ 
ipation. The majority of parents completed 
study questionnaires in April and May (78%), 
whereas the remaining parents completed 
study measures in June (1%), July (1%), Au¬ 
gust (16%), and September (4%). The child’s 
KRA-L scores, as well as the summary of 
the child’s screening, were inserted into the 
child’s kindergarten registration file for the 
principal and teacher to use as desired. Teach¬ 
ers provided consent to participate and com¬ 
pleted the teacher BESS for all consented chil¬ 
dren in October. This time frame was chosen 
to allow adequate time for teachers to become 
familiar with their students. Teachers were 


compensated with $80. Indicators of academic 
achievement and school behavior were col¬ 
lected on a regular basis throughout the aca¬ 
demic year (August through June). 

RESULTS 

Descriptive statistics for all predictor 
and criterion variables are presented in Ta¬ 
ble 1. The data were used to examine the 
utility of the parent rating scales identifying 
risk status, as well as the incremental validity 
of parent ratings in predicting criterion vari¬ 
ables beyond that predicted by the existing 
school district academic screening tool. 

Utility in Identifying Risk Status 

Children were first categorized as at risk 
or typically developing for each criterion vari¬ 
able (i.e., BESS teacher rating scale, percent 
green days, GPA for Quarters 1 and 4). Re¬ 
ceiver operating characteristic (ROC) analyses 
were used to assess the utility of each subscale 
of the DBD and SDQ in identifying student 
risk status on each criterion. ROC analyses 
produce an area-under-the-curve (AUC) value 
that represents a ratio of sensitivity and spec¬ 
ificity for identifying at-risk status. AUC val¬ 
ues above .5 indicate that the parent subscale 
score predicts at-risk status on the indicator at 
a rate greater than chance (see Table 2). There 
is variability in the qualitative terms used to 
describe the magnitude of AUCs, without de¬ 
finitive agreement on these terms (e.g., Pintea 
& Moldovan, 2009; Swets, 1996). Thus, we 
describe the results using the following rang¬ 
es: .50 to .70, low to moderate; .70 to .90, 
moderate to strong; and .90 and above, strong. 

Identifying Risk Status Based on Teacher 
Ratings 

In identifying risk status on the BESS, 
the AUCs for the SDQ and DBD subscales 
ranged from .50 to .68 (see Table 2). With the 
exception of the SDQ emotional problems 
subscale (AUC = .50), confidence intervals 
(CIs) indicated that all subscales were similar 
in identifying risk status on the BESS. 
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Identifying Risk Status Based on Daily 
Behavior 

In identifying risk status based on 
SW-PBS green days, the AUCs for the SDQ 
and DBD subscales ranged from .45 to .71 
(see Table 2). Consistent with results for the 
BESS, with the exception of the SDQ emo¬ 
tional problems subscale (AUC = .45), CIs 
indicated that all subscales were similar in 
identifying risk status on this criterion 
variable. 

Identifying Risk Status Based on Grades 

The AUCs for first-quarter and fourth- 
quarter grades ranged from .56 to .69. (The 
AUCs for the second- and third-quarter grades 
all fell within the CIs of those for the first and 
fourth quarters and are not presented because 
of space limitations.) Across both quarters, the 
DBD inattention subscale had the highest 
AUC statistic for identifying risk status based 
on GPA and the SDQ emotional problems 
subscale had the lowest. 

Incremental Validity 

To better understand which subscales 
best predicted the criterion variables, relative 
to each other and beyond the predictive utility 
of the school district’s academic screening 
tool (KRA-L), a forward linear regression 
analysis was conducted for each outcome in¬ 
dicator. To account for the nesting of students 
within teachers, a regression model with ran¬ 
dom intercept was applied. Random effects 
(resulting from students nested within teach¬ 
ers) were not applied to the predictors because 
they were collected before the start of the 
academic year; thus it was reasonable to as¬ 
sume there was no effect of teacher on them. 
A manual forward analysis was conducted as 
follows. In the first step, the KRA-L total was 
entered. In the second step, the best predictor 
(among the three DBD subscales and five 
SDQ subscales) was chosen and entered based 
on the one that provided the best improvement 
in three information criteria: Akaike informa¬ 
tion criterion, corrected Akaike information 
criterion, and Bayesian information criterion. 
The second step was repeated until either no 
improvement in all three information criteria 


could be made or no significant predictors 
could be added. See Table 3 for correlations 
between predictor and criterion variables and 
Table 4 for regression results. 

Predicting Teacher Rating Scores 

In the first step, the KRA-L accounted 
for 20.82% of the variation in the teacher 
BESS scores. In the second step, the SDQ 
behavioral problems subscale offered the most 
improvement in prediction based on the infor¬ 
mation criteria, accounting for an addi¬ 
tional 10.40% of the variation in BESS scores. 
In the third, fourth, and fifth steps, the DBD 
hyperactivity-impulsivity (2.02%), SDQ emo¬ 
tional problems (2.01%), and DBD opposi¬ 
tional defiant behavior (1.36%) subscales each 
accounted for additional variance in BESS 
scores. 

Predicting Daily Behavior 

In the first step, the KRA-L accounted 
for 6.18% of the variation in green days. On 
the basis of the information criteria, the SDQ 
behavioral problems subscale offered the most 
improvement in the second step, accounting 
for an additional 13.99% of variation in green 
days. In the third, fourth, and fifth steps, the 
DBD hyperactivity-impulsivity (2.55%), SDQ 
emotional problems (4.15%), and DBD oppo¬ 
sitional defiant behavior (1.68%) subscales 
each produced significant improvement in 
prediction. 

Predicting Grades 

In the first step, the KRA-L accounted 
for 40.50% and 23.36% of the variation in the 
first- and fourth-quarter GPA, respectively. 
For the first-quarter GPA, in the second and 
third steps, the SDQ peer problems (3.31%) 
and prosocial behavior (1.63%) subscales of¬ 
fered significant improvement in prediction 
based on the information criteria. For the 
fourth-quarter GPA, only the SDQ peer prob¬ 
lems subscale (3.69%) accounted for addi¬ 
tional variance. 

Determining Possible Cut Scores 

Once the subscales with significant in¬ 
cremental validity were identified, diagnostic 
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Table 4. Random Coefficient Regression Results for Parent Ratings 
Predicting Teacher Ratings, Behavior, and GPA 

A R 2 (3 SE 


BESS at-risk status 
Step 1: KRA-L total 
Step 2: SDQ behavioral problems 
Step 3: DBD hyperactivity-impulsivity 
Step 4: SDQ emotional problems 
Step 5: DBD oppositional defiant behavior 
Additional R 2 because of Step 2 to 5 
Total R 2 

80% green for behavior 
Step 1: KRA-L total 
Step 2: SDQ behavioral problems 
Step 3: DBD hyperactivity-impulsivity 
Step 4: SDQ emotional problems 
Step 5: DBD inattention 
Additional R 2 because of Step 2 to 5 
Total R 2 

Q1 GPA 

Step 1: KRA-L total 
Step 2: SDQ peer problems 
Step 3: SDQ prosocial behavior 
Additional R 2 because of Step 2 to 3 
Total R 2 

Q4 GPA 

Step 1: KRA-L total 
Step 2: SDQ peer problems 
Additional R 2 because of Step 2 
Total R 2 


20.82% 

-0.618* * 

0.084 

10.40% 

1.864* 

0.434 

2.02% 

5.148* 

1.251 

2.01% 

-0.741 

0.320 

1.36% 

-3.905 

1.755 

15.79% 

36.61% 

6.18% 

0.004* 

0.001 

13.99% 

-0.025* 

0.005 

2.55% 

-0.110* 

0.025 

4.15% 

0.016* 

0.005 

1.68% 

0.065 

0.026 

22.37% 

28.55% 

40.50% 

0.066* 

0.005 

3.31% 

-0.071* 

0.024 

1.63% 

0.045 

0.018 

4.94% 

45.44% 

23.36% 

0.042* 

0.006 

3.69% 

-0.082* 

0.025 

3.69% 

27.05% 


Note. Depending on variables in model, n ranges from 231 (model with Q4 GPA) to 248 (80% green). DBD = 
Disruptive Behavior Disorders Rating Scale; GPA = grade point average; KRA-L = Kindergarten Readiness Assess¬ 
ment-Literacy; Q = quarter; SDQ = Strengths and Difficulties Questionnaire. 

*p < .01. 


utility statistics (sensitivity, specificity, posi¬ 
tive predictive power, negative predictive 
power) were examined to determine cut scores 
that balanced false positive to false negative 
ratios. Although sensitivity, specificity, and 
positive predictive power rates exceeding .75 
are considered ideal (Glover & Albers, 2007), 
other authors have noted that rates between 
.40 and .60 may be acceptable in the first stage 
of a screening protocol (Kettler & Feeney- 
Kettler, 2011). The cut scores presented in 
Table 5 are those that represent the best bal¬ 
ance of sensitivity and specificity, assuming a 


base rate of 20%, consistent with epidemio¬ 
logic data (Kataoka et al., 2002). 

DISCUSSION 

This study provides preliminary evi¬ 
dence for the utility and incremental validity 
of score inferences derived from parent SDQ 
and DBD rating scales, completed at kinder¬ 
garten registration, in relation to important 
kindergarten outcomes (i.e., teacher ratings, 
grades, and daily behavior reports). In brief, 
the ROC analyses and the diagnostic effi- 
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Table 5. Diagnostic Efficiency Statistics for Parent-Rated Subscales 
Predicting Criterion Variables 



SEN 

SPE 

EFF 

PPV 

NPV 

TP 

FP 

TN 

FN 

SDQ behavioral problems subscale identifying at-risk status on BESS 




Cut score of 2 

.58 

.68 

.67 

.31 

.87 

6.0% 

28.6% 

61.1% 

4.4% 

Cut score of 3 

.46 

.83 

.79 

.40 

.86 

4.8% 

15.5% 

74.2% 

5.6% 

Cut score of 4 

.31 

.93 

.87 

.52 

.84 

3.2% 

6.3% 

83.3% 

7.1% 

DBD hyperactivity-impulsivity subscale identifying at-risk status on BESS 




Cut score of 0.5 

.73 

.49 

.51 

.26 

.88 

7.5% 

46.0% 

43.7% 

2.8% 

Cut score of 1 

.54 

.79 

.77 

.39 

.87 

5.6% 

18.7% 

71.0% 

4.8% 

DBD oppositional defiant behavior subscale identifying at-risk status on BESS 



Cut score of 0.5 

.50 

.67 

.65 

.27 

.84 

5.2% 

29.8% 

60.0% 

5.2% 

Cut score of 1 

.31 

.91 

.85 

.47 

.84 

3.2% 

7.9% 

81.7% 

7.1% 

SDQ emotional problems subscale identifying at-risk status on BESS 




Cut score of 1 

.58 

.39 

.41 

.19 

.79 

6.0% 

54.8% 

34.9% 

4.4% 

Cut score of 2 

.35 

.65 

.62 

.20 

.80 

2.6% 

31.3% 

58.3% 

6.7% 

Cut score of 3 

.23 

.82 

.76 

.25 

.81 

2.4% 

15.9% 

73.8% 

7.9% 

SDQ behavioral problems subscale identifying at-risk status on green days 




Cut score of 2 

.58 

.69 

.68 

.32 

.87 

7.1% 

27.4% 

60.3% 

5.2% 

Cut score of 3 

.45 

.83 

.79 

.40 

.86 

5.6% 

14.7% 

73.0% 

6.7% 

Cut score of 4 

.23 

.92 

.84 

.42 

.83 

2.8% 

6.7% 

81.0% 

9.5% 

DBD hyperactivity-impulsivity subscale identifying at-risk status on green days 



Cut score of 0.5 

.74 

.51 

.54 

.28 

.89 

9.1% 

42.9% 

44.8% 

3.2% 

Cut score of 1 

.48 

.79 

.75 

.37 

.86 

6.0% 

18.3% 

69.4% 

6.3% 

DBD inattention subscale identifying at-risk status on 

green days 





Cut score of 0.5 

.52 

.63 

.62 

.26 

.84 

6.3% 

32.5% 

55.2% 

6.0% 

Cut score of 1 

.32 

.86 

.79 

.36 

.84 

4.0% 

12.7% 

75.0% 

8.3% 

SDQ emotional problems subscale identifying at-risk status on green days 




Cut score of 1 

.52 

.38 

.40 

.17 

.76 

6.3% 

54.4% 

33.3% 

6.0% 

Cut score of 2 

.29 

.64 

.60 

.17 

.78 

3.6% 

31.3% 

56.3% 

8.7% 

Cut score of 3 

.16 

.81 

.73 

.18 

.80 

2.0% 

16.3% 

71.4% 

10.3% 

SDQ peer problems subscale identifying at-risk status 

on Q1 GPA 





Cut score of 1 

.75 

.48 

.58 

.27 

.89 12.3% 

42.7% 

39.5% 

4.4% 

Cut score of 2 

.50 

.71 

.67 

.30 

.85 

8.9% 

23.8% 

58.5% 

8.9% 

Cut score of 3 

.30 

.86 

.76 

.34 

.83 

5.2% 

11.7% 

70.6% 

12.5% 

SDQ prosocial behavior subscale a identifying at-risk status on Q1 GPA 




Cut score of 7 

.35 

.83 

.77 

.22 

.90 

4.3% 

14.9% 

72.8% 

8.1% 

Cut score of 6 

.21 

.88 

.79 

.19 

.89 

2.6% 

10.6% 

77.0% 

9.8% 

Cut score of 5 

.10 

.95 

.85 

.23 

.88 

1.3% 

4.3% 

83.4% 

11.1% 

SDQ peer problems subscale identifying at-risk status on Q4 GPA 





Cut score of 1 

.72 

.46 

.49 

.25 

.87 

8.9% 

47.7% 

40.0% 

3.4% 

Cut score of 2 

.52 

.30 

.68 

.30 

.85 

6.4% 

26.0% 

61.7% 

6.0% 

Cut score of 3 

.31 

.14 

.79 

.36 

.83 

3.8% 

12.3% 

75.3% 

8.5% 


Note. BESS = Behavioral and Emotional Screening System; DBD = Disruptive Behavior Disorder; EFF = diagnostic 
efficiency; FN = false negatives; FP = false positives; GPA = grade point average; NPV = negative predictive value; 
PPV = positive predictive value; Q = quarter; SDQ = Strengths and Difficulties Questionnaire; SEN = sensitivity; 
SPE = specificity; TN = true negatives; TP = true positives. 

a For the prosocial behavior subscale, lower scores are indicative of greater problems in this area. 
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ciency statistics indicated that most parent¬ 
rated subscales offered low to moderate utility 
in identifying children demonstrating at-risk 
academic performance in the first and last 
quarter of the year (i.e., GPA) and SEB prob¬ 
lems in the first quarter (teacher BESS ratings) 
and throughout the year (percent green days). 
However, parent ratings of child SEB prob¬ 
lems provided incremental validity relative to 
the academic screening tool used by the school 
district, accounting for an additional 3% to 
22% of the variance in kindergarten outcomes. 
These data highlight (a) the potential utility of 
obtaining parent report prior to kindergarten 
entry to identify children who are likely to 
struggle to achieve grade-level expectations 
for academic, social, and behavioral function¬ 
ing and (b) the importance of incorporating 
screening results into a more comprehensive, 
multi-informant, and multistep process that is 
linked to a service-delivery framework. In the 
following sections, we discuss our findings in 
relation to previous studies with samples of 
young children and offer recommendations for 
research and practice. 

Identifying and Predicting SEB 
Functioning 

We used the teacher BESS as an indica¬ 
tor of SEB functioning and SW-PBS green 
days as an indicator of behavioral functioning. 
The strongest predictor of these criterion vari¬ 
ables was SDQ behavioral problems, account¬ 
ing for 10% to 14% of variance in these out¬ 
comes beyond that accounted for by the 
KRA-L. Furthermore, the depth of items on 
the DBD hyperactivity-impulsivity and oppo¬ 
sitional defiant behavior subscales predicted 
additional variance in BESS scores, and the 
depth of the DBD inattention subscales pre¬ 
dicted additional variance in green days. The 
SDQ emotional problems subscale also ex¬ 
plained a small incremental proportion on both 
outcomes. Collectively, these results are con¬ 
sistent with evidence that SEB problems, de¬ 
tected in young children at the beginning of 
the year, are associated with peer rejection, 
student-teacher conflict, and impaired class¬ 


room functioning at the end of the year (Fan- 
tuzzo et ah, 2003; Ladd, Birch, & Buhs, 1999). 

The ROC analyses provide important 
information about dichotomous decisions that 
school personnel must make using the screen¬ 
ing results. The AUCs identifying at-risk sta¬ 
tus on the teacher BESS (e.g., behavioral 
problems = .68; 95% Cl [.55, .80]) and green 
days (e.g., prosocial behavior = .71; 95% Cl 
[.61, .81]) were low to moderate yet compa¬ 
rable to findings from other studies that have 
examined utility statistics in a cross-informant 
or cross-method approach. For example, dur¬ 
ing the process of developing the teacher 
BESS, DiStefano and Kamphaus (2007) re¬ 
ported that teacher ratings on the 23-item 
screening tool identified children with a clin¬ 
ical diagnosis (compared with children with¬ 
out a diagnosis) at a rate greater than chance; 
the AUC was .70 (95% Cl [.64, .77]). Other 
studies have found higher AUCs, ranging be¬ 
tween .70 and .80, for both parent and teacher 
SDQs identifying children with clinical diag¬ 
noses (e.g., Stone et al., 2010). This study 
advances the literature by documenting this 
relationship with more distal and ecologically 
valid indicators of risk (i.e., diagnoses are 
often made concurrently and have consider¬ 
able conceptual overlap with screening 
measures). 

The diagnostic utility statistics docu¬ 
ment the trade-offs that have to be made re¬ 
garding false-positive and false-negative rates 
at different cut scores. On the one hand, the 
negative predictive values for most subscales 
are above .80, raising confidence that if the 
child’s screening results are negative, he or 
she is highly unlikely to show at-risk behavior 
during kindergarten. On the other hand, the 
sensitivity and positive predictive values indi¬ 
cate that it is premature to recommend that 
parent ratings on these measures be used in 
isolation or as a first gate in multiple-gating 
procedures because the false-positive rates 
(i.e., inaccurately telling parents that their kin¬ 
dergarten child is at risk) may create undue 
stress for parents and teachers alike and the 
false-negative rates may contribute to unde¬ 
tected academic risk. Future research that fol¬ 
lows children longitudinally across multiple 
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years could better inform the manner in which 
parent ratings can be incorporated into school 
district screening, monitoring, and assessment 
procedures. 

This preliminary evidence that parent 
ratings completed at kindergarten entry can 
predict teacher perceptions and daily behav¬ 
ioral functioning across the entire year repre¬ 
sents a significant contribution to the litera¬ 
ture. Given that disruptive classroom behavior 
is (a) stressful for teachers (Greene et al., 
2002), (b) one of the most common reasons 
that teachers refer children for services 
(Lloyd, Kauffman, Landrum, & Roe, 1991), 
and (c) a significant contributor to teachers 
leaving the profession (Ingersoll, 2001), being 
able to identify such risk prior to school entry 
provides the opportunity to better target chil¬ 
dren for monitoring, further assessment, or 
early intervention. 

Identifying and Predicting Academic 
Functioning 

We used quarterly GPA as the indicator 
of academic functioning. Because the KRA-L 
is designed to assess early literacy skills, it is 
not surprising that KRA-L scores accounted 
for a large proportion of the variance in the 
Quarter 1 GPA (41%) and a much lower por¬ 
tion of the variance in the Quarter 4 GPA 
(23%). Interestingly, the SDQ peer problems 
subscale was the strongest significant predic¬ 
tor of student GPA at the beginning of the year 
and end of the year, accounting for a small but 
perhaps meaningful portion of incremental 
variance. Unlike the KRA-L, the proportion of 
variance explained by the SDQ peer problems 
subscale remained fairly stable across the 
year. This pattern underscores the importance 
of attending to social functioning when at¬ 
tempting to understand and influence aca¬ 
demic functioning. This finding is consistent 
with other studies that have documented the 
positive relationship between peer problems 
and academic problems (DiPerna, Volpe, & 
Elliott, 2001), as well as studies documenting 
the independent and additive contributions of 
peer rejection and diagnostic status on 
academic impairment (Mikami & Hinshaw, 


2006). Although a better understanding of this 
cross-domain relationship is important, the di¬ 
agnostic utility statistics indicate that parent 
ratings of peer relations have limited sensitiv¬ 
ity and positive predictive power. Thus, addi¬ 
tional research is needed to examine other cut 
scores to define at-risk academic performance 
and other indicators of academic functioning 
in this age group. In addition, it could be 
fruitful to explore the sensitivity and specific¬ 
ity of other measures of parent-rated social 
functioning at this age. Our study findings 
suggest that such research has merit. 

Conducting Universal Screening at 
Kindergarten Registration 

Although nearly 60% of children in the 
United States attend preschool or receive cen¬ 
ter-based care (U.S. Census Bureau, 2013), 
systematic screenings prior to kindergarten 
may not be feasible given the diversity of 
infrastructure across such preschool settings. 
The structure of kindergarten registration rep¬ 
resents an opportune context in which to ob¬ 
tain parent-based screening information that 
enhances school professionals’ preparedness 
to meet the unique needs of every child. How¬ 
ever, detecting at-risk status in a young child 
is a sensitive issue. Arguments cautioning 
against early mental health screening include 
the wide variability in normative behavior at 
this developmental stage, the risk of false pos¬ 
itives, the burden placed on schools to address 
the needs of children detected, and the poten¬ 
tial financial and personnel costs. Given the 
low base rate of the behaviors to be detected, 
the sensitivity and positive predictive power 
rates for screening tools identifying cross-in- 
formant or cross-method outcomes may be 
limited. Although these concerns warrant 
careful attention, the studies reviewed in the 
introduction offer important evidence that the 
advantages of early intervention may out¬ 
weigh the disadvantages associated with the 
aforementioned concerns. Furthermore, re¬ 
garding concern about false positives, untow¬ 
ard outcomes that may arise as a function of a 
false-positive identification can be mitigated 
by how detected cases are handled. For exam- 
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pie, researchers and school districts could take 
different actions based on different levels of 
screening scores. Exceeding the lowest cut 
score could simply trigger closer monitoring 
of the student and parent communication, 
whereas exceeding a higher cut score could 
trigger a cross-informant assessment or the 
provision of early intervention in the general 
education environment. 

It is important to recognize that univer¬ 
sal screening does not produce additional 
need; rather, screening simply identifies stu¬ 
dents who might have unmet need. Although 
some investigators may argue that there is no 
harm in waiting a few months to obtain 
teacher report, the results of our study suggest 
that universal screening identifies many of 
those children who are likely to emerge as 
needing assistance (e.g., through current pro¬ 
cedures such as lower grades and referrals to 
the office) but identifies them earlier, offering 
the opportunity for parent-teacher communi¬ 
cation, the development of monitoring sys¬ 
tems, and early intervention. Furthermore, 
given the moderate correlations between par¬ 
ent and teacher ratings, parent screening iden¬ 
tifies a unique subset of children who may not 
be detected by teacher ratings. Lastly, there is 
some evidence that children who are detected 
via a teacher-rated universal screener, but are 
deemed to be false positives because they did 
not qualify for special education, are still im¬ 
paired and in need of intervention because of 
lower scores on measures of intelligence, lan¬ 
guage, and academic achievement as com¬ 
pared with children who did not fail the 
screening test (true negatives; Glascoe, 2001). 

Limitations 

Although the current data are of interest 
to researchers and practitioners, they should 
be considered within their limitations. First, 
the teacher sample is homogeneous regarding 
race and the child sample is homogeneous 
regarding race and socioeconomic status. 
Thus, replication with larger samples and 
more racially and economically diverse sam¬ 
ples is warranted. Second, because teachers 
had access to the results of the parent screen¬ 


ing, such access could have influenced teacher 
ratings or services provided during the year. 
However, because the screening was new to 
the district and systematic procedures for how 
to use the screening results had not been im¬ 
posed, this influence was minimal. Indeed, 
informal interviews with school staff sug¬ 
gested that the reports were used minimally 
and inconsistently across teachers and school 
buildings. This highlights the importance of 
developing and evaluating systematic proce¬ 
dures and accountabilities for using screening 
data. Third, because quarterly grades in kin¬ 
dergarten are typically not based on quizzes, 
tests, and homework assignments, GPA in kin¬ 
dergarten may simply represent another rating 
of teacher’s perception (and possibly bias) of 
student progress toward the academic stan¬ 
dards. Fourth, because psychometric data are 
not available for the KRA-L, the incremental 
validity for parent ratings may differ when 
using a different academic screening tool. 
Lastly, longitudinal data are needed to under¬ 
stand the longer-term outcomes of identified 
children. 

Preliminary Recommendations for 
Practice 

Despite the aforementioned limitations, 
the results suggest that parent report at kinder¬ 
garten entry offers significant, unique, and 
meaningful data about child risk status beyond 
that offered by the school district’s academic 
screening and thus is worthy of inclusion in 
future research and practice. Given the consis¬ 
tency of our findings with the empirical liter¬ 
ature, as well as the consistency of the AUCs 
across multiple domains of functioning, across 
multiple methods of assessment (teacher rat¬ 
ing scales, grades, and daily behavior), and 
across time, preliminary recommendations can 
be made. First, the data support the use of 
either the DBD or the SDQ in future research. 
Given that the SDQ behavioral problems sub¬ 
scale was the strongest predictor of teacher 
BESS scores and SW-PBS green days and that 
the SDQ offers information about social func¬ 
tioning (i.e., peer problems and prosocial be¬ 
havior) that is not offered by the DBD, it may 
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be advantageous to use the SDQ. A similar 
recommendation has been made for teacher 
screening tools because broader screening 
tools (assessing more domains) are more use¬ 
ful than narrower tools that assess one domain 
in predicting later teacher perceptions of the 
student, peer relations, and grades (Flanagan, 
Bierman, Kam, & Conduct Problems Preven¬ 
tion Research Group, 2003). 

Second, these data suggest that school 
personnel should consider how to include par¬ 
ent ratings in their screening procedures. Iden¬ 
tification on such ratings could trigger a mon¬ 
itoring procedure in which identified children 
are reviewed and discussed at intervention as¬ 
sistance team meetings, as well as a process 
for proactive communication with parents be¬ 
fore problems further develop. Then, addi¬ 
tional screening, assessment, and intervention 
services could be titrated over time to address 
problems that persist. Such a process could 
lead to increased efficiency in identification 
and facilitate the provision of early interven¬ 
tion to reduce the cost and intensity of inter¬ 
ventions needed later. 

Lastly, parent rating scales completed at 
kindergarten registration provide a unique op¬ 
portunity to obtain a “preview” of incoming 
students. This information could be used to 
distribute children with varying strengths and 
weaknesses evenly across classrooms or to 
promote better student-teacher fit. Using 
screening data, school staff may be able to 
prevent the development of some negative 
teacher-student relationships or to prevent one 
teacher from being assigned the majority of 
the most demanding students by chance. Fu¬ 
ture studies could also examine the extent to 
which children identified as at risk in the 
spring prior to kindergarten would benefit 
from some type of school readiness program¬ 
ming during the summer prior to kindergarten 
entry. 

Summary 

Obtaining parent ratings during kinder¬ 
garten registration gives school staff a head 
start of several months to assess individual 
student needs and make decisions related to 


monitoring and intervention. Although the 
AUC rates indicate that the parent ratings on 
the DBD and SDQ could not be used as a 
screening tool in isolation, they do offer incre¬ 
mental information on social, behavioral, and 
academic functioning during the kindergarten 
year, accounting for up to 22% of additional 
variance beyond that explained by the aca¬ 
demic screener used by the school district. 
These data provide an important foundation 
for future research examining the utility of 
parent ratings in multistep, multi-informant 
screening and intervention processes. 
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